2025
7 posts
01-30
【论文阅读笔记】OneLLM: One Framework to Align All Modalities with Language
01-25
3D-LLM: Injecting the 3D World into Large Language Models
01-25
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding
01-25
Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models
01-25
NExT-GPT: Any-to-Any Multimodal LLM
01-25
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
01-17
【论文阅读笔记】MagicMap: Enhancing Indoor Navigation Experience in VR Museums
2024
15 posts
12-02
【论文阅读笔记】Map-Relative Pose Regression for Visual Re-Localization
11-24
【论文阅读笔记】Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
11-22
Scaling Data Generation in Vision-and-Language Navigation
11-02
【论文阅读笔记】Building Rome in a Day
09-22
【论文阅读笔记】DirectGPT: A Direct Manipulation Interface to Interact with Large Language Models
09-11
【论文阅读笔记】CodeAid: Evaluating a Classroom Deployment of an LLM-based Programming Assistant that Balances Student and Educator
09-11
【论文阅读笔记】CodeHelp: Using Large Language Models with Guardrails for Scalable Support in Programming Classes
09-11
【论文阅读笔记】Teach AI How to Code: Using Large Language Models as Teachable Agents for Programming Education
09-10
【论文阅读笔记】CodeTailor: LLM-Powered Personalized Parsons Puzzles for Engaging Support While Learning Programming
09-02
【论文阅读笔记】Coding with AI: How Are Tools Like ChatGPT Being Used by Students in Foundational Programming Courses
08-10
【论文阅读笔记】Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
08-10
【论文阅读笔记】Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models
08-10
【论文阅读笔记】Graph of Thoughts: Solving Elaborate Problems with Large Language Models
08-08
【论文阅读笔记】Automatic Chain of Thought Prompting in Large Language Models
08-08
【论文阅读笔记】Multimodal Chain-of-Thought Reasoning in Language Models
